Robust Reinforcement Learning with Relevance Vector Machines

نویسندگان

  • Minwoo Lee
  • Charles W. Anderson
چکیده

Function approximation methods, such as neural networks, radial basis functions, and support vector machines, have been used in reinforcement learning to deal with large state spaces. However, they can become unstable with changes in the samples state distributions and require many samples for good estimations of value functions. Recently, Bayesian approaches to reinforcement learning have shown advantages in the explorationexploitation tradeoff and in lower sampling costs. This paper proposes a novel reinforcement learning framework that uses the relevance vector machines (RVM) as a function approximator, which incrementally accumulates knowledge from experiences based on the sparseness of the RVM model. This gradual knowledge construction process increases the stability and robustness of reinforcement learning by preventing possible forgetting. In addition, RVM’s low sampling costs improve the learning speed. The approach is examined in the popular benchmark problems of pole-balancing and mountain car.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Reinforcement Learning framework Using Relevant Vector Machines

In this work we present an advanced Bayesian formulation to the task of control learning that employs the Relevance Vector Machines (RVM) generative model for describing value functions. The key aspect of the proposed method is the design of the discount return as a generalized linear model that constitutes a well-known probabilistic approach. This allows to augment the model with advantageous ...

متن کامل

Adaptive and Efficient Image Retrieval with One-Class Support Vector Machines for Inter-Query Learning

We present an extension of previous work on improving the initial image retrieval set by exploiting both intra and inter-query learning. In most Content-Based Image Retrieval (CBIR) systems based on Relevance Feedback (RF), all prior experience is lost whenever a user generates a new query, thus inter-query information is not used. In previous work, a system was developed that learns One-class ...

متن کامل

A Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels

The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...

متن کامل

On Robustness and Regularization of Structural Support Vector Machines

Previous analysis of binary support vector machines (SVMs) has demonstrated a deep connection between robustness to perturbations over uncertainty sets and regularization of the weights. In this paper, we explore the problem of learning robust models for structured prediction problems. We first formulate the problem of learning robust structural SVMs when there are perturbations in the sample s...

متن کامل

State generalization method with support vector machines in reinforcement learning

The conventional reinforcement learning assumes discrete state space. Therefore, it is necessary to make states discrete in order to handle continuous state environments. However, if a simple discretization is applied, the number of states increases exponentially with the dimension of the state space, and the learning time increases. In this paper, we propose a state generalization that is able...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016